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BACKGROUND OF THE INVENTION 

1. The Field of the Invention 

[0001] The present invention relates to the field of video processing. In particular, the 
present invention relates to the compression of a video stream when it is known that the 
video stream is to be subsampled for minimal loss after subsampled decoding. 

2. Background and Relevant Art 

[0002] Video constitutes a series of images that, when displayed above a certain rate, 
gives the illusion to a human viewer that the image is moving. Video is now a widespread 
medium for communicating information whether it be a television broadcast, a taped 
program, or the like. More recently, digital video has become popular. 
[0003] An uncompressed digital video stream has high bandwidth and storage 
requirements. For example, the raw storage requirement for uncompressed CCIR-601 
resolution 4:2:2: serial digital video is approximately 20 megabytes per second of video. In 
addition, associated audio and data channels also require bandwidth and storage. From a 
transmission bandwidth perspective, 20 megabytes per second is much faster than 
conventional transmission techniques can practicably support. In addition, from a storage 
perspective, a two-hour movie would occupy approximately 144 Gigabytes of memory, well 
above the capabilities of a conventional Digital Versatile Disk (DVD). Therefore, what 
were desired were systems and methods for compressing (or coding) digital video in a way 
that maintains a relatively high degree of fidelity with the original video once uncompressed 
(or decoded). 

[0004] One conventional high-quality compression standard is called MPEG-2, which is 
based on the principle that there is a large degree of visual redundancy in video streams. By 
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removing much of the redundant information, the video storage and bandwidth requirements 
are significantly reduced. 

[0005] Figure 1A illustrates a display order 100A of a sequence of pictures. If the video 
stream represents progressive video, the pictures represent individual progressive frames. If 
the video steam represents interlaced video, the pictures represent individual interlaced 
frames containing two fields each. 

[0006] Under the MPEG-2 standard, there are three classes of pictures, I-pictures, P- 
pictures and B-pictures. While MPEG-2 allows for a number of display orders for groups of 
pictures, the display order illustrated in Figure 1A is commonly used. In this common 
display order, there are a series of I-pictures. For clarity, only I-pictures Ii and Ii6 are shown 
in Figure 1A. Each consecutive I-picture pair has four P-pictures interspersed there 
between. For example, P-pictures P 4 , P7, Pio and Pn are interspersed between consecutive I- 
pictures Ii and Ii 6 - In addition, two B-pictures are interspersed between each I-picture and 
each of its neighboring P-pictures. Two B-pictures are also interspersed between each 
consecutive P-picture pair. For example, B-pictures B2 and B3 are interspersed between I- 
picture Ii and P-picture B4, B-pictures B 5 and Be are interspersed between P-pictures P4 and 
P 7 , B-pictures B 8 and B 9 are interspersed between P-pictures P 7 and P10, B-pictures B n and 
B12 are interspersed between P-pictures P10 and P13, and B-pictures B14 and B15 are 
interspersed between P-picture P13 and I-picture Ii6- 

[0007] The I-pictures are "intra-coded" meaning that they can be restructured without 
reference to any other picture in the video stream. 

[0008] The P-pictures are "inter-coded" meaning that they may only be restructured with 
reference to another reference picture. Typically, the P-picture may include motion vectors 
that represent estimated motion with respect to the reference picture. The P-picture may be 
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reconstructed using the immediately preceding I-picture or P-picture as a reference. In 
Figure 1 A, arrows illustrate the predictive relationship between pictures wherein the picture 
at the head of the arrow indicates the predictive picture, and the picture at the tail of the 
arrow indicates the reference picture used to reconstruct the predictive picture. For example, 
the reconstruction of P-picture P7 uses P-picture P4 as a reference. 

[0009] B-pictures are also inter-coded. The B-picture is typically reconstructed using 
the immediately preceding I-picture or P-picture as a reference, and the immediately 
subsequent I-picture or P-picture as a reference. For example, the reconstruction of B- 
picture B14 uses P-picture P13 and I-picture lis as references. 

[0010] Figure IB illustrates the decode order 100B of the pictures. The decode order is 
similar to the display order except that reference frames are decoded prior to any predictive 
pictures that rely on the reference picture, even if the reference picture is displayed after the 
predictive picture. Thus, the arrows in Figure IB are all rightward facing. 
[0011] Figure 2 A illustrates the general process involved with encoding a digital picture 
201 using an encoder 200 A that is compatible with the MPEG-2 standard. If the digital 
picture is to be an I-picture, the digital picture bypasses the motion estimator 202 and is 
provided to the discrete cosine transformation unit (DCT) 203, which transforms the digital 
picture, on a block-by-block basis from a spatial representation of an image to a frequency 
representation of the image. The frequency representation is then passed to a quantization 
unit 204, which quantizes each frequency, on a macroblock-by-macroblock basis, into 
definable ranges. A "macroblock" is a 16-pixel by 16-pixel array within the picture. The 
quantized image is then passed to a variable length coder 205 which performs, for example, 
variable length Huffman coding on the resulting quantized image. The reduced sized I- 
picture is then stored or transmitted for subsequent decoding. 
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[0012] If the digital picture 201 is to be a P-picture, the encoding process is similar as 
for I-pictures with several notable exceptions. If a P-picture, the digital picture is passed 
first to the motion estimator 202. For each macroblock (i.e., 16x16 pixel array) in the P- 
picture, the motion estimator 202 finds a close match to the macroblock in the reference 
picture. The motion estimator 202 then represents the macroblock in the P-picture as a 
motion vector representing the motion between the macroblock in the P-picture and the 
close match 16x16 pixel array in the reference picture. In addition to the motion vector, a 
difference macroblock is calculated representing the difference between the macroblock in 
the P-picture and the close match 16x16 pixel array in the reference frame. A macroblock 
represented as a difference with corresponding motion vectors is typically smaller than a 
macroblock represented without motion vectors. Discrete cosine transformation and 
quantization are then performed on just the difference representation of the P-picture. Then, 
the difference information is combined with the motion vectors before variable length 
coding is performed. 

[0013] B-pictures are encoded similar to how P-pictures are encoded, except that motion 
may be estimated with reference to a prior reference picture and a subsequent reference 
picture. 

[0014] Figure 2B illustrates a conventional decoder 200B in conformance with the 
MPEG-2 standard. First, a variable length decoder 215 performs, for example, variable 
length decoding on the picture. The picture (or the difference data of the picture if a P- 
picture or a B-picture) is passed to the inverse quantizor 214 for inverse quantization on a 
macroblock-by-macroblock basis. Next, an inverse discrete cosine transformer 213 
performs inverse discrete cosine transformation on the frequency representation of the 
picture, on a block-by-block basis, to reconstruct the spatial representation of the picture. 
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The spatial representation of the picture is passed to the motion compensator 212 where the 
spatial representation is combined with the motion vectors (if a P-picture or B-picture) to 
thereby reconstruct the digital picture 201'. The reconstructed digital picture 201' is labeled 
differently than the original picture 201 to represent that there may be some loss in the 
encoding process. 

[0015] In this manner, MPEG-2 combines the functionality of motion compensation, 
discrete cosine transformation, quantization, and variable length coding to significantly 
reduce the size of a video stream with some generally acceptable reduction in video quality. 
Despite conventional standards such as MPEG-2 that provide significant compression to a 
video stream, it is desirable to reduce the bandwidth requirements of the video stream even 
more to maximize network and storage performance. 

[0016] One way to further reduce the bandwidth requirements is to compress the video 
stream even beyond the compression performed during the original MPEG-2 encoding 
processes. However, this results in a loss of video information and thus degrades the quality 
of the video stream to a certain extent. Therefore, what are desired are systems and methods 
for further compressing a video stream with less, if any, loss of video information. 
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BRIEF SUMMARY OF THE INVENTION 
[0017] The present invention extends to both methods and systems for transcoding a 
video stream so as to reduce the size of the video stream with little, if any, degradation of 
video quality after subsampling. The video steam includes a number of video pictures such 
as frames or fields and may be stored in memory or accessed from a transmission. In 
addition, each video picture includes one or more blocks. These blocks are the fundamental 
unit upon which subsampling may be performed. For example, under the MPEG-2 standard, 
developed by the Moving Pictures Experts Group, subsampling may be performed on blocks 
of 8 pixels by 8 pixels. 

tjg [0018] The video management system accesses one of the video pictures from the video 

|B stream. Then, for at least one block of the video picture, the video management system 

m 

$j represents the block as a matrix of pixel values. Then, the block matrix is pre-multiplied by 

W a pre-multiplication matrix and post-multiplied by a post-multiplication matrix. The pre- 

si 

n 

^ multiplication matrix is generated from a subsample matrix that represents the subsampled 

decoding in one direction. The post-multiplication matrix is generated from a subsample 
^ |dt matrix that represents the subsampled decoding in a substantially perpendicular direction. 

W 

^ z E [0019] The pre-multiplication matrix and the post-multiplication matrix are structured so 
^ p > 1 3 s 

g|pgp| that the block of pixels is altered in a manner that subsampling of the altered block of pixels 

q | § a 8 o results in the same subsampled image as subsampling of the original block of pixels. The 

£ i < § S ^ pre-multiplication matrix and the post-multiplication matrix are also designed to decrease 

2 the size of the encoded version of the block of pixels. 

O 

^ [0020] This strategic altering of blocks of pixels may be repeated for each block in the 

video picture and for each video picture in the video stream that is to be subject to 
subsampled decoding. Accordingly, the memory and bandwidth requirements of the video 
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stream may be substantially reduced with the satisfaction that the reduction comes at 
minimal cost in video quality assuming that the video stream is to ultimately be subsample 
decoded. In one aspect of the invention, the further compressed video stream is sent to a 
subsample decoder where it is subsampled and presented on a display device. 
[0021] Additional features and advantages of the invention will be set forth in the 
description which follows, and in part will be obvious from the description, or may be 
learned by the practice of the invention. The features and advantages of the invention may 
be realized and obtained by means of the instruments and combinations particularly pointed 
out in the appended claims. These and other features of the present invention will become 
more fully apparent from the following description and appended claims, or may be learned 
by the practice of the invention as set forth hereinafter. 



- Page 8 - 



Docket No. 14531.114 




BRIEF DESCRIPTION OF THE DRAWINGS 
[0022] In order to describe the manner in which the above-recited and other advantages 
and features of the invention can be obtained, a more particular description of the invention 
briefly described above will be rendered by reference to specific embodiments thereof, 
which are illustrated in the appended drawings. Understanding that these drawings depict 
only typical embodiments of the invention and are not therefore to be considered to be 
limiting of its scope, the invention will be described and explained with additional 
specificity and detail through the use of the accompanying drawings in which: 
[0023] Figure 1A illustrates a display order of an MPEG-2 video stream in accordance 
. J with the prior art; 

!H [0024] Figure IB illustrates a decode order of an MPEG-2 video stream in accordance 

y t 

\T* with the prior art; 

i '"" [0025] Figure 2 A illustrates an encode sequence in accordance with MPEG-2 and in 

accordance with the prior art; 

[0026] Figure 2B illustrates a decode sequence in accordance with MPEG-2 and in 
accordance with the prior art; 

[0027] Figure 3 schematically illustrates a video network in which the principles of the 
present invention may operate; and 

[0028] Figure 4 is a flowchart of a method for transcoding a video stream so that there is 
little, if any, loss in video quality after subsampling in accordance with the present 
invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



[0029] Subsampling is a process that reduces the dimensions of a video image such as 
when the video stream is to be displayed in a reduced-size picture-in-picture display. The 
present invention extends to both methods and systems for reducing the size of the video 
stream with minimal, if any, effect on the video quality as displayed after subsampling. A 
video management system accesses a video stream by receiving the video stream from a 
video channel, or by accessing a memory where the video stream is stored. Once the video 
management system determines that only a reduced-size version of the video stream is 
ultimately to be displayed as when the video stream is to be subject to subsampling, the 
video management system compresses each picture (e.g., frame or field) of the video frame. 
Although this compression would cause loss of picture quality if the picture were to be 
displayed in its full size, this compression is performed in such a manner that there is little, 
if -any, loss in video quality as displayed after subsampling. Any loss in video quality would 
be primarily due to re-quantization, and finite-precision effects inherent in computer 
processing. 

[0030] Embodiments within the scope of the present invention include computer- 
readable media for carrying or having computer-executable instructions or data structures 
stored thereon. Such computer-readable media can be any available media that can be 
accessed by a general purpose or special purpose computer. By way of example, and not 
limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM 
or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any 
other medium which can be used to carry or store desired program code means in the form 
of computer-executable instructions or data structures and which can be accessed by a 
general purpose or special purpose computer. 
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[0031] When information is transferred or provided over a network or another 
communications connection (either hardwired, wireless, or a combination of hardwired or 
wireless) to a computer, the computer properly views the connection as a computer-readable 
medium. Thus, any such connection is properly termed a computer-readable medium. 
Combinations of the above should also be included within the scope of computer-readable 
media. Computer-executable instructions comprise, for example, instructions and data 
which cause a general purpose computer, special purpose computer, or special purpose 
processing device to perform a certain function or group of functions. 
[0032] The precise operating environment in which the principles of the present 
invention are implemented is not important to the present invention. The principles of the 
present invention may be implemented in any operating environment that is able to 
implement the principles of the present invention. For example, given suitable software 
and/or adaptation, general-purpose computers, special-purpose computers or special purpose 
processing devices (whether now developed or to be developed in the future) might 
implement the principles of the present invention. In addition, the principles of the present 
invention may be implemented by software, hardware, firmware or any combination thereof. 
[0033] As will be described in further detail below, the principles of the present 
invention are most advantageous when the video processing in accordance with the present 
invention is followed by subsampling. The environment discussed below with respect to 
Figure 3 illustrates just one example of an environment in which subsampling is performed 
and is provided for illustrative purposes only, and not for purposes of limiting the claims. 
One of ordinary skill in the art will easily recognize that the principles of the present 
invention may be implemented in any environment where the video processed in accordance 
with the present invention is to be subsampled. 
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[0034] Figure 3 and the corresponding discussion provide a general description of a 
network 300 in which the present invention may operate. The network 300 includes a 
management system 310 that receives video input 301, performs appropriate processing on 
the video, and then distributes the video. The video is distributed either directly to a display 
device 31 1 or else to a video node such as one of video nodes 320 through 323, where the 
video may be subject to further processing, including perhaps subsampling, before being 
distributed to the corresponding display device 330 through 333. For illustrative purposes, 
four video nodes are shown although the management system 310 may work with other 
numbers of video nodes. The management system 310 need not just perform video 
processing. For example, the management system 310 may also communicate non- video 
information with networks 303 over link 302 and process the non-video information as well. 
[0035] The video management system 310 includes a memory 341 that may store the 
computer-executable instructions described above, and a processor 342 that is coupled to the 
memory 341 through, for example, a bus 343 so as to be able to execute the computer- 
executable instructions. The video management system 310 also includes a video decoder 
344 that decodes video in accordance with a video decoding standard such as, for example, 
MPEG. A transcoder 345 operates to reduce the memory and bandwidth requirements of the 
video 302 and may do such by implementing the principles of the present invention 
described herein. If the video decoder 344 and the transcoder 345 are implemented at least 
partially in hardware, the video decoder 344 and the transcoder 345 would be coupled to the 
bus 343 as shown in Figure 3. However, as will be apparent to those of ordinary skill in the 
art, the principles of the present invention may be implemented by hardware, software, or a 
combination of hardware and software. 
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[0036] While Figure 3 and the corresponding discussion above provide a general 
description of a suitable environment in which the invention may be implemented, it will be 
appreciated that the features of the present invention disclosed herein may be practiced in 
association with a variety of different system configurations. 

[0037] Figure 4 illustrates a method 400 for transcoding a video stream so as to reduce 
the size of the video picture with minimal, if any, effect on the video quality after 
subsampling. In the context of Figure 3, the video management system 310 may perform 
this transcoding at a given subsampling ratio and then provide the transcoded video stream 
to one of the video nodes (e.g., video node 320) over video network 300. The video node 
320 would then perform subsampled decoding using the same subsampling ratio. 
[0038] First, the video stream is accessed (act 401). Then, for at least one of the blocks 
in a video picture of the video stream, the size of the encoded block is reduced without 
substantially reducing image quality as measure after subsampling (act 402). Preferably, all 
of the blocks in all of the video pictures in the video stream are compressed when it is 
known that the video stream is to be ultimately subject to subsampled decoding. The 
processing of an already encoded video stream (or any already encoded data component for 
that matter) into a different encoded video stream is often referred to as "transcoding" since 
the video stream is moved from one encoded state to another. The compression of the 
encoded video stream to generated a more compressed video stream with minimal, if any, 
loss in video quality after subsampling is one example of transcoding and will be referred to 
herein as "subsampled transcoding" although subsampled decoding is still needed after the 
subsampled transcoding in order to display the reduced size image. In the context of Figure 
3, for example, this subsampled transcoding may be performed by transcoder 345. 
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[0039] In order to generate the reduced size blocks, the block of pixels is represented as 
a matrix (act 403). The block of pixels may either be represented by a "spatial domain" 
matrix or by a "transform domain" matrix. A spatial domain matrix of a block of pixels 
means that the element values of the matrix are specific pixel values that are laid out 
spatially in the matrix according to the position of the corresponding pixel in the block. 
Thus, the element in row 3, column 2 of the spatial domain matrix represents a pixel value 
corresponding to a pixel in row 3, column 2 of the block of pixels. A transform domain 
matrix of a block of pixels is less intuitive and is represented by performing a transform on 
the spatial domain matrix. Each element in the transform domain matrix represents a 
discrete transform relationship between the pixel values in the spatial domain matrix. For 
example, if the transform domain matrix is a frequency domain matrix, one common 
discrete frequency relationship is defined by the well-known Discrete Cosine Transform 
(DCT) operation. Before describing a suitable pre-multiplication matrix and post- 
multiplication matrix that are suitable for acts 404 and 405, respectively, the mathematical 
relationship between transform domain matrices and spatial domain matrices will now be 
briefly described followed by a mathematical description of how subsampling typically 
occurs. 

[0040] A transform domain matrix A may be generated by performing pre-multiplication 
and post-multiplication on a corresponding spatial domain matrix P. This operation is 
represented in matrix form by the following equation 1 : 

A = DxPxE (1) 



where, 
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P is the spatial domain matrix corresponding to the transform domain matrix^; 
A is the transform domain matrix corresponding to the spatial domain matrix P; 
D is the transform matrix for the vertical direction; and 
E is the transform matrix for the horizontal direction. 

[0041] If the spatial domain matrix P is, for example, an 8-by-8 matrix where each 
element represents a pixel component, the matrices D, E and A are also 8-by-8 matrices. 
There is no requirement that the matrices D and E be unitary or symmetric. Also there is no 
requirement that the D and E represent the same transform. In one case, D could represent a 
Discrete Cosine Transform (DCT) matrix in the vertical direction, while E represents D 
transpose (i.e., the DCT matrix in the horizontal direction). However, in another example D 
could represent a wavelet transform matrix in the vertical direction, while D represents the 
DCT matrix in the horizontal direction. 

[0042] Conversely, the spatial domain matrix P may be generated from a transform 
domain matrix by performing an inverse matrix transform on the transform domain matrix. 
This inverse operation is represented in matrix form by the following equation 2: 

P = D~ l xAxE- } (2) 

[0043] Subsampling of the spatial domain matrix P occurs by pre-multiplying the matrix 
P by a subsampling matrix that defines the subsampling in one direction such as when 
performing horizontal subsampling. The resulting subsampled matrix may then be post- 
multiplied by the transpose of another subsampling matrix that defines the subsampling in a 
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substantially perpendicular direction as when performing vertical subsampling. This 
subsampling is performed on the spatial domain matrix P as illustrated by the following 
equation 3: 



p=SxPxT (3) 
where, 

p is the subsampled spatial domain matrix of the spatial domain matrix P\ 
S is the subsampling matrix that is used for horizontal subsampling; and 
T' is the transpose of the subsampling matrix T that is used for vertical 
subsampling. 

[0044] Rewriting equation 3 by substituting the value of matrix P from equation 2 
results in the following equation 4: 

p=SxD~ x xAxE~ x xT (4) 

[0045] Each of these matrices may conceptually be split into four separate quadrants 
based on the subsample size. For example, the matrix A may be rewritten as the following 
equation 5: 



A A 

, BL BR . 



(5) 
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where, 

Atl is a matrix component having a size that is proportional to the matrix A by 
the same ratio as the proportion of the subsampled picture to the original 
picture; 

Atr is a matrix component that resides to the right of the matrix An\ 
Abl is a matrix component that resides below the matrix Atl\ and 
A B R is a matrix component that resides below the matrix A T r and to the right of 
the matrix Am. 

[0046] Similarly, the top two components of the matrix may be combined and the 
bottom two components may be combined so that the matrix A is defined as in the following 
equation 6: 



A = 



Aj- 



(6) 



where, 

At is a matrix component defined by the combination of Atl and Atr\ and 
Ab is a matrix component defined by the combination of Abl and Abr- 

[0047] For instance, if the matrix A is an 8 row by 8 column matrix, and the 
subsampling cuts each dimension size (horizontal and vertical) in half, the matrix 
component Atl would be a 4 row by 4 column matrix. Consequently, the other matrix 
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components Atr, Abu and Abr would also be 4 row by 4 column matrices. In this case, the 
matrix components At and As would each be 4 rows by 8 columns. This subsample ratio is 
used as an example in the following description although the present invention works with 
.other subsampling ratios. For example, if the picture were subsampled by 75%, each 8 row 
by 8 column matrix would be reduced to a mere 2 row by 2 column matrix. In this latter 
case, the matrix component An would be a 2 row by 2 column matrix. Consequently, the 
matrix components Atr would be a 2 row by 6 column matrix, the matrix component Abl 
would be a 6 row by 2 column matrix, and the matrix component A B r would be a 6 row by 6 
column matrix. In this case, the matrix component A T would be two rows by eight columns 
and the matrix component Ab would be 6 rows by 8 columns. Subsampling may also occur 
in just one direction, horizontal or vertical, with no subsampling occurring in the other 
direction. 

[0048] Thus, the size of the matrix components is important for the transcoder to know 
when performing the subsample transcoding in accordance with the present invention since 
the subsampling ratio used to perform subsample transcoding by the transcoder 345 should 
be the same as the subsampling ratio used to perform subsampled decoding at the video 
node 320. This knowledge may be inferred by the transcoder 440. For example, if the video 
stream corresponds to the reduced-size image of a picture-in-picture display, the reduced 
size image may always be a certain size (e.g., half the size for each dimension). Thus, the 
transcoder 440 may infer that if subsampled decoding is to occur at all, it is at a subsampling 
ratio of 50% in each direction. 

[0049] Referring to Figure 3, the video node (e.g., video node 320) that is to display the 
reduced size image informs the management system 310 via network 300 the identity of any 
channel for which a reduced size video image is desired. If the transcoder 345 cannot 
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accurately infer the subsampling ratio (e.g., a user may be able to adjust the size of the 
picture-in-picture image), the video node 320 also informs the video management system of 
the appropriate subsampling ratio that is to be performed at the video node. 
[0050] In accordance with the principles of the present invention, the transform domain 
matrix A is converted into a matrix a that has zero values in all but its upper left component 
an- Specifically, matrix a may be represented by the following equation 7. 



\3 

E , ? 



a = 



a Tl Z 
Z Z 



(7) 



^3 where, 

Z represents matrix components having zero values for all elements. 



p [0051] Since the matrix a has many zero values, coding methods such as Huffman 

m 

H variable length coding reduce the coded representation of the matrix a significantly as 

lB 

j2 P compared to the coded representation of the matrix A. Thus, the size of the coded video 

S - stream is significantly reduced when converting matrix A to the matrix a for each block in 

u3 o 3 g 1 5 each picture of the video stream. 

q * I a I u [0052] In accordance with the principles of the present invention, the matrix A is 



converted into the matrix a in such a manner that subsampled decoding of the matrix a 
results in the same pixel block (i.e., matrix p defined by equation 4) as subsampled decoding 



% the matrix A, Specifically, the following equation 8 holds true: 



SxD~ ] xAxE' ] xT^SxD-' xaxE' 1 xT 
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[0053] As mentioned above, the matrix a only has the potential for non-zero elements in 
its upper left matrix component a T L- Thus, once one determines what the matrix component 
an should be, one has also determined what the matrix a should be. The inventors have 
discovered that an appropriate matrix component an that will cause equation 8 to be 
satisfied so that subsampled decoding of the matrix a results in substantially the same 
picture as subsampled decoding of the matrix A is defined by the following equation 9: 



m 
m 

I . £ . 



a TL =(ml)~ x xSxPx T^/tl)' 1 (9) 

[0054] In equation 9, the matrix {ml)' 1 x S represents an example of the pre- 
multiplication matrix that the block matrix P is pre-multiplied by in act 404 of Figure 4. The 
matrix T x (nlj 1 represents an example of a post-multiplication matrix that the block matrix 
;j5 P is post-multiplied by in act 405 of Figure 4. 

jig [0055] In equation 9, the matrix P represents the spatial domain representation of a 

> n 

J f*fc block of pixels and, in a typical example, is an 8 row by 8 column matrix. 

2 g w 5 [0056] The matrix S is the vertical subsampling matrix. For example, if each dimension 

O % % S 5 5 of the picture is cut in half when subsampling, and the block P has 8 rows, the matrix S 

§ £ s 3 £ a would be a 4 row by 8 column matrix. 

- g < | o h 

5 5 2 * < [0057] The matrix T 5 is the transpose of the matrix T. The matrix T is the horizontal 
subsampling matrix. For example, if each dimension of the picture is cut in half when 
subsampling, and the block P has 8 columns, the matrix T would be an 8 row by 4 column 
matrix. 
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[0058] The matrix {ml)' 1 is the multiplication inverse of matrix ml such that {ml)' 1 x ml 
= I where "7" is the identity matrix. Matrix ml equals S x (D" 1 )^, where (£> _1 )ieft is the left 
portion of the inverse of D. As an illustrative but non-limiting example of the dimension of 
ml, if S is a four row by eight column matrix and (D' l )\ t ft is an eight row by four column 
matrix, the matrix ml and the matrix {ml)' 1 are both four row by four column matrices. 
[0059] The matrix {nl)' 1 is the multiplication inverse of matrix nl such that {nl)' 1 xnl = 
I where "7" is the identity matrix. Matrix nl equals (TT 1 )^ x T\ As an illustrative but non- 
limiting example of the size of nl, if {K l ) iop is a four row by eight column matrix and T* is 
an eight row by four column matrix, the matrix nl and the matrix {hi)' 1 are both four row by 
four column matrices. 

[0060] The dimension of the resulting pre-multiplication matrix {ml)' 1 x S by which the 
matrix P is pre-multiplied according to equation 9 is obtained as follows. The number of 
rows in this matrix is equal to the number of rows in S and the number of columns in this 
matrix is equal to the number of rows in P (the latter typically being 8). 
[0061] The dimension of the resulting post-multiplication matrix T x {nl)' 1 by which 
the matrix P is post-multiplied according to equation 9 is obtained as follows. The number 
of rows in this matrix is equal to the number of columns in P (which is typically 8), and the 
number of columns is equal to the number of rows in T. 

[0062] To continue with the illustrative but non-limiting example of the dimensions of 
the matrices involved, in this example, equation 9 results in the pre-multiplication of an 
eight row by eight column matrix by a four row by eight column pre-multiplication matrix, 
and in the post-multiplication of the eight row by eight column matrix by an eight row by 
four column pre-multiplication matrix. In this example, the result is a four row by four 
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column matrix that constitutes the potential non-zero values of the eight row by eight 
column matrix a, 

[0063] Referring to Figure 4, after the subsample transcoding is completed for each 
block, variable length coding is performed on the blocks and then the transcoded video is 
either stored for future subsampled decoding, or the transcoded video is provided to the 
subsample decoder (act 406). In the context of Figure 3, the video management system 310 
may either store the video stream in memory 341, or else may provide the video stream to 
another component external to the video management system 310. 

[0064] Since the transcoded video stream is smaller after variable length coding than the 
original encoded video stream, less memory is required to store the video stream if the video 
stream is stored. Also, less network bandwidth is required to transmit the video stream if the 
video stream is transmitted over the network. According, the memory and network 
bandwidth needed to handle the video stream are reduced. The subsample transcoding 
described above would result in a loss of image quality if subsampled decoding was not to 
occur. However, if it is known that the video stream is to ultimately be subsample decoded, 
the video stream may be subsampled transcoded in accordance with the present invention 
with the assurance that the subsampled transcoding will result in no lost image quality after 
subsampled decoding. Accordingly, although some additional processing is required to 
perform the subsampled transcoding, the principles of the present invention allow for 
reduced memory and bandwidth requirements with no cost in terms of loss of video quality 
after subsampled transcoding. 

[0065] The present invention may be embodied in other specific forms without departing 
from its spirit or essential characteristics. The described embodiments are to be considered 
in all respects only as illustrative and not restrictive. The scope, of the invention is, 
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therefore, indicated by the appended claims rather than by the foregoing description. All 
changes which come within the meaning and range of equivalency of the claims are to be 
embraced within their scope. 

[0066] What is claimed and desired to be secured by United States Letters Patent is: 
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